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Image object processmg 



FIELD OF THE INVENTION 

The inventioii relates to a method and apparatus for object processing for at 
least one image. 

5 BACKGROUND OF THE INVENTION 

Conventional video and TV systems distribute video signals which inherently 
are two dimensional (2D) in nature. However, it would in many qyplications be desirable to 
further provide three dimensional (3D) information. For example, 3D ioformalion nmy be 
used for enhancing object grasping and video compression for video signals. 

10 In particular three dimensional video or television (3DTV) is promising as a 

means for enhancing the user experience of the presentation of visual content, and 3DTV 
could potentially be as significant as the introduction of colour TV. 

The most commercially interesting 3DTV systems are based on re-use of 
existing 2D video infrastmcture liiereby allowing for a minimal cost and compatibility 

IS problems associated with a gradual roll out. For these systems, 2D video is distributed and is 
converted to 3D video at the location of the consumer. 

The 2D-to-3D conversion process adds (depth) structure to 2D video and may 
also be used for video compression. However, the conversion of 2D video into video 
comprising 3D information is a major image processing challenge. Consequently, significant 

20 research has been undertaking in this area and anumber of algorithnos and q[»proaches have 
been suggested for extracting 3D information fix>m 2D images. 

Known methods for derivmg depth or occlusion relations from monoscopic 
video comprise the structure fiom motion approach and the dynamic occlusion approach. 

In the structure fimn motion ^iproach, points of an object are tracked as the 

25 object moves and are used to derive a 3D model of the object. The 3D model is determined as 
tiiat which would most closely result in the observed movement of the tracked points. The 
dynamic occlusion ^yproach utilises the &ct Ifaat as diffisrent obj^^ts move within the picture, 
the occlusion (i,e. the overlg^ of one object over anoth^ in a 2D picture) provides 
information indicative of the relative depth of the objects. 
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However, structure from motion requires fhe presence of camera motion and 
cannot deal with independently moving objects (non-static scene). Furthermore, both 
approaches rely on the existence of moving objects and feil in situations where there is very 
littie or no apparent motion in the video sequence. 
5 Methods for deriving depth information based on static characteristics have 

been suggested. 

A depth cue which may provide static information is a T-junction 
corresponding to an intersection between objects. However, although the possibility of using 
T-junctions as a deplfa cue for vision has been known for a long time, computational methods 

10 for detecting T-junctions in video and use of T-junctions for automatic depth extraction have 
had very limited success so &r. 

Previous research into the use of T-junction has mainly focussed on the T- 
junction detection task and example of schemes for detecting T-junctions are given in 
"Filtering, Segmentation and Depfli" by M. Nitzber& D. Mumford and T.Shiota, 1991. 

15 Lecture Notes in Computer Science 662, Springer-Verlag, Berlin; "Steerable-scalable kernels 
for edge detection and junction analysis" by P. Perona, 1992. 2nd European Conference of 
Computer Vision pages 3-18 linage and Vision Computing, vol. 10, pag. 663-672 and 
"Junctions: Detection, Qassification, and Reconstruction", L. Parida, D. Geiger, R. Hummel, 
1998 IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, no.7, pp.687- 

20 698. 

Furthermore, the document "Filtmng, Segmentation and Depth" by M. 
Nitzberg, D. Mumford and T.Shiota discloses a system for determining depth information 
based on T-junctions. This approach is based on determination of contours and require non- 
linear filtering, curve smoothing, comer and junction detection and curve continuation. 
25 Hence, the desoibed method is very con^lex and requires significant computational resource 
but does provide for extraction of depth information fiom static characteristics. 

However, although significant research has.been undertaking in the field of 
object processing for depth information, the accuracy and reliability of the extracted depth 
information is currentiy not as good as desired. 
30 Furthermore, as the individual processes rely on detection of specific features 

of an image (e.g. T-junctions or comer points of an object), the accurate and reliable 
detection of these features is critical and is forthermore con^utationally demanding. 

Hence, an inqiroved system for object processing for at least one image would 
be advantageous and in particular a sj^item for object processing for depth information 
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allowing for reduced complexity, reduced computatioiial burden and/or ioiproved 
performance would be advantageous. 

SUMMARY OF THE INVENTION 
5 Accordingly, the Invention preferably seeks to mitigate, alleviate or eliminate 

one or more of the above mentioned disadvantages singly or in any combination. 

The inventors of the current invention have realised that improved 
perfomiance in object processing and in particular object processing for depth information 
may be achieved by combining different processes and in particular by combining processes 

10 based on dynamic characteristics wi& processes based on static characteristics. Furthermore, 
the inventors have realised that these jnocesses may be based on different features of the 
image &r optimal performance. 

Accordingly there is provided according to a first aspect of the invention, a 
mefliod of object processing fi>r at least one image conipising the steps of: detecting a 

IS plurality of image points associated with at least one object of the at least one image; 

grouping the plurality of image points into at least a grortp of object points and a group of 
junction points; and individually processing the image points of the group of object points 
and die group of junction points. 

The invention allows for detected image points being grouped according to 

20 whether they are object points or junction points. Object points may be advantageously used 
for detertnining depth information based on dynamic characteristics and junction pointe may 
advantageously be used for determining depth information based on static characteristics. 
Thus, the invention allows for image points of one or more images to be separated into 
different groi:9>s whichnGiay then be individually processed. Thus the invention allows for 

25 inqnroved performance as object processes may be supplied with tiie optimal image points for 
the specific process. Furthermore, inq>roved performance may be achieved as object points 
and junction points are separated thereby reducing the probability that object points are fed to 
a proce^ requiring jmiction points and vice versa The invention fiirthermore allows for a 
single detection process to be used for detecting image points rather than conq>lex dedicated 

30 processes for detecting only object points and junction points. The simple detection may be 
followed by a simple process which determines whether a given image point is more likely to 
be an object point or a junction point Thus, detection proc^es may be re-used for di£ferent 
types of image points thereby r^uUjng in reduced complexity and reduced computational 
requirements. 
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An object point may typically be a feature of a single object in the image, such 
as a comer or side of the object whereas a junction point typically refers to a relative feature 
between two or more objects, such as an intersection point between two objects wherein one 
occludes Ifae other. 

S According to a feature of the invention, the stq> of individually processing 

comprises determining at least one three dimensional characteristic from at least one two 
dimensional image. Hence, the invention allows for an improved process of determining 3D 
characteristics fiom one or more 2D images. Specifically, &e invention allows for an 
improved and low complexily method for determining depth information which may 

10 ixreferably combine relative and absolute depth information determined in response to static 
and dynamic characteristics respectively. 

Accordmg to another feature of the invention, the plurality of image points is 
fiirther grouped into a group of falsely detected points. This allows for improved performance 
of the individual processiug of the image points as the probability of including fgdsely 

1 S detected points in the processing is reduced. 

According to another feature of the invention, each of the plurality of image 
points is included in only one group selected from the group of object points, the group of 
junction points and the group of &lsely detected points. 

Preferably, each point which has been detected is identified as either an object 

20 point or a junction point or a falsely detected point The individual processing may be 

improved as this may result in an inoieai^ probability that the processing is based only on 
points of the sppropoBto type. For example, applying an object point identification routine to 
apoint may result in an indication that the image point has a probability above 0.5 of it being 
an object point However, applying a junction point identification routine to tiie same image 

25 point may also result in an indication tiiat ibe ima^ point has a probability above 0.5 of it 
being a junction point Therefore, using these routines independentiy may result in the image 
point being used both for object point processing and junction point processing. However, by 
only allocating the image point to the group that it most likely corresponds to, the probability 
of it being included in flie wrong group is reduced. For example, tiie junction point 

30 idratification routine may provide a jjrobability of 0.95 and the object point identification 
routine may provide a probability of 0.5 in which case the image point may be added to the 
group of junction ix>ints. 

According to another feature of tiie invention, the step of individually 
processing comprises qqplying a first process to the group of object points and qaplying a 
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second process to the group of junction pointe. Preferably different processes are applied to 
the object points and tibe junction points. Thus, the jSrst process may be based on or 
particularly suited for processmg object pomts whereas the second process may be based on 
or particularly suited for processing junction points. The first and second process may be 
completely separate. Furthermore, the results of the first and second process may be 
combined to provide improved results over that which can be achieved from each individual 
process. 

According to another feature of flie invention, the first process is an object 
process based on object motion within the at least one image. Object points are particularly 
well suited for processes based on object motion. Specifically, object points are suitable for 
determining or processing the movements of an object and particularly movements of a 3D 
object in a 2D image. Hence improved performance of an object process based on object 
motion may be achieved. The first process may for example be a process for object 
identification, object tracking or depth detection. As a sfpedfic example, the first process may 
be a dynamic occlusion depth detection inocess. 

According to anotha feature of the invention, the first process is a structure 
from motion jirocess. Hence, the invention may allow for inqproved 3D structure information 
to be derived fixm object motion determined from object points. 

According to another feature of the invention, the second process is an object 
process based on a static characteristic within the at least one image. Junction points are 
particularly well suited for determining static characteristics. The second process may for 
example be an object identification proc^. 

According to another feature of the invention, the second process is a process 
for determining a depth characteristic of at least one object of the at least one image. Junction 
points are particularly well suited for processes determining depth information based on static 
characteristics and specifically relative depth information between different objects may be 
determined. Thus inojxroved performance of an object process determining depth inf<mnation 
may be achieved. 

Preferably the first process is a process for determining depth information in 
response to dynamic characteristics associated wifli the object points and the second process 
is a process for determining depth infomiation in response to static characteristics associated 
wifli flie junction points. The depth information derived by the first and second processes is 
preferably combined thereby providing additional and/or more accurate and/or reliable depth 
information. 
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According to another feature of the invention, the depth characteristic is a 
relative depth characteristic indicating a relative depth between a plurality of objects of the at 
least one image. Junction points are particularly suitable for detemuning relative depth 
information. 

5 According to another feature of the invention, the step of detecting the 

plurality of image points comprises applying a curvature detection process to at least a part of 
the at least one image. A curvature detection process is a i>articularly single and efifective 
process for detecting image points but does not dififerentiate between the different types of 
image points. Hence, Ihe invration allows for a low con^lexily, easy to implement detection 
1 0 process having low computational resource requirement to be used while providing good 
performance. 

Preferably the junction points comprise T-junction points corresponding to an 
overlap between two objects of the at least one image. 

According to a second aspect of the invention, there is provided an ^yparatus 

15 for object processing for at least one image conqmsing: means for detecting a plurality of 

imsLge points associated with at least one object of the at least one image; means for grouping 
the plurality of image points into at least a group of object points and a group of jimction 
points; and means for individually processing the image points of the group of object points 
and tiie grotp of junction points. 

20 These and oilier asipects, features and advantages of the invention will be 

sqpparent from and elucidated with reference to the embodiment(s) described hereinafter. 

BRIEF DESCRIPTION OF THE DRAWINGS 

An embodiment of the invention will be described, hy way of example only, 
25 with reference to the drawings, in which 

FIG. 1 illustrate an example of a 2D image conotprising two objects; 
FIG. 2 illustrates an apparatus for object processing of one or more images in 
accordance with a preferred embodiment of the invention; 

FIG. 3 illustrates a flow chart of a method of object processing in accordance 
30 with an embodiment of the invention; and 

FIG. 4 illustrates an example of a T-junction in an image 
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DESCRIPTION OF PREFERRED EMBODIMENTS 

The following description focuses on an embodiment of fhe invention 
ai>plicable to object processes fi>r detennining depth infomiation firom a two-dimensional 
image. However, it wiQ be appreciated that fhe invention is not limited to this ai)plication but 
5 may be applied to many other object processes including object detection or object imaging 
processes. 

The possibility of extracting depth information from two dimensional images 
(including images of video sequences) is attracting increasing attention and promises to 
provide enhanced functionality of image applications as well as enabling new applications. 

10 For example, extraction of depth information may enable three dimensional video images to 
be g^erated from conventional two dimensional video. 

Typically methods for extracting depth information conqntise detecting depth 
information in resfponse to movement of objects within images. Thus otgect points 
corresponding to specific points of an object are tracked between images and these object 

IS points are used to determine a 3D model of tiie object One such method determines structure 
jfrom motion of an object 

FIG. 1 illustrates an example of a 2d image comprising two objects. 
Specifically, the image comprises a first cube 101 and a second cube 103. In Ihe structure 
from motion process, first object points 105 correspondmg to the comers of the first cube 101 

20 are used to determine a 3D model of the first cube 101. Similarly, second object points 107 
corresponding to the comers of the second cube 103 are used to determine a 3D model of the 
second cube 103. Parameters of the 3D models are determined such that the comer points 
when projected on to a 2D representation perform the movement observed of the comer 
points in the 2D image. 

25 Thus, processes such as the structure fixnn motion process require tiiat comer 

points of objects are detected. An exan^le of a detector which may be used for detection of 
object comers is given in M. Pollefeys, R. Koch, M. Vergauwen and L. van Gool, *Tlexible 
acquisition of 3D stmcture from motion^, Proc, BEBE IMDSP Workshop, pp. 195-198, 1998. 
This detectcn:, as most olher known detectors, relies e^qplicitiy or implicMy on detecting 

30 comer points based on curvature properties, i.e. on abnqrt variations in a parameter such as 
brightness or colour. 

A disadvantage of these detectors is that they not only detect comer points but 
also detect many other image points. For exantple, the detectors may &lsely detect points 109 
which do not correspond to a comer of an object but which coinddentally have properties 
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which meet the detector criteria. FurfhOTnore, other points such as junction points and 
specifically T-junction points 111 may also show abmpt changes in image parameters and 
accordingly be detected by the comer detector. However, for processes such as the stracture 
firom motion, it is essential that only real fixed object points are used to determine the 3D 
5 model. For example, considering the fidsely detected points 1 09 or the junction points 1 1 1 as 
comer points will distort the 3D model that can be derived (or prevent one fixmi being 
derived). 

Accordingly it may be necessary to use a significantly more complex detector 
which has significantly reduced probability of detecting unwanted image points. However, 

10 this requires very comyplex processing and results in an increased computational burden. 

Alternatively or additionally, tiie valid object points must be extracted fix>m 
the detected image points. This may be achieved by deriving 3D models wherein image 
points Ifaat do not fit in are discarded. However, this is not only a very complex process but 
also has a high probability of erroneously discarding valid object points or included 

1 5 unwanted object points. 

The inventors of the current invention have realised that rather than simply 
extracting object points (such as comer points), improved performance may be derived by 
dividing image points detected by an image point detector into groups of image points of 
different categories. Specifically, the inventors have realised that junction points noiay be 

20 individually processed and may advantageously be used to derive depth information 
improving or supplementing the depth information derived from the object points. 

The invCTtors have realised that dividing the detected image points into at 
least a group of junction points and a group of object points will &cilitate the detection 
process and allow for a simple c<nnmon detection algorithm to be used both for detection of 

25 image points for an object point process and for a junction i)oint process. Hence, a simplified 
detection is achieve with reduced complexity and conoputational burden. In addition, 
improved performance may be achieved of flie individual processes as the probability of 
unwanted image points ^roneously being used in a given process is desired. Specifically, it is 
typically more reliable to detect whether an image point is more likely to be an object point 

30 or a jimction point than it is to determine whether a given point is an object point or not 

Furthermore, the inventors have realised that rather than discarding the junction points these 
may be processed independently and the results possibly combined witii Ifaose r^nilting fix)m 
tiie object point proems Iheroby iiqproving the overall performance of a depth information 
processing. 
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HG. 2 illustrates an apparatus 200 for object processing of one or preferably 
more images in accordance with a {Mreferred embodiment of the invention. 

The ^paratus comprises a detector 201 which receives images and performs 
an image detection process which detects both object points and junction points. Thus, the 
detector 201 detects a plurality of image points associated with at least one object of the 
image(s). For example, an arbitrary curvature detector that finds both object points and T- 
junctions without discriminating between these may be used 

The detector 201 is coupled to a grouping processor 203 which is cperable to 
group the plurality of image points into at least a group of object points and a group of 
junction points. In the preferred embodiment, the image points may &rther be grouped into a 
group of felsely detected points i.e. image points which are considered to be neither object 
points nor junction pomts. 

The grouping processor 203 is coupled to an object pomt store 205 wherein 
the detected object points are stored and a junction point store 207 wherein the detected 
junction points are stored The falsely detected points are singly discarded 

The object point store 20S and junction point store 207 are connected to a 
processor arrangement 209 which is operable to individually process the image points of the 
group of object points and the group of junction points. In the preferred embodiment, the 
processor arrangement 209 comprises an object point processor 211 coupled to the object 
point store 205 and operable to process the stored object points. Specifically, the object point 
proctor 21 1 may perform a depth information process such as the structure firom motion 
process. The processor arrangement 209 further conq>rises a junction point processor 213 
coupled to the junction point storo 207 and operable to process the stored junction points. 
Specifically, the junction point processor 213 may perform a depth information process based 
on T-junctions. 

The object point processor 21 1 and junction point processor 213 are in the 
preferred enlbodiment coupled to a combine processor 215 which combines the depth 
information generated by the individual jirocesses. For example, a depth map for the image 
may be generated wherein the relative depth relationships between oligects are determined 
predonmianfly on the basis of the information fi'om the junction point processor 213 whereas 
depth characteristics of the individual object is determined predominantly on the basis of the 
information fix>m the object point processor 211. 

It will be appreciated that although the above descrq>tion for clarity has 
referred to differont processors tiiese should only be considerod as fimctional modules rather 
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than as physical entities. Thus, tiie different processes may for example in the preferred 
embodiment be performed by a single digital signal processor. 

FIG. 3 illustrates a flow chart of a method of object processing in accordance 
with a preferred embodiment of the invention. 
S The method initiates in step 301 wherein a plurality of image points associated 

with at least one object of the at least one image is detected. In the preferred embodiment, a 
curvature detection process is q)plied to tihe whole or to at least a part of one or more images. 

For example, the detection algorithm described in M. PoUefeys, R. Koch, M. 
Vergauwen and L. van Gool, 'Tlexible acquisition of 3D structure from motion", Proc. lE^ 
10 IMDSP Woricshop, pp. 195-198, 1998 naay be used. Alternatively or additionally, a detection 
based on segmentation of an image may be used. 

Step 301 is followed by step 303 wheiem the plurality of image points are 
grouped into at least a group of object points and a ffoxsp of junction points and preferably 
into a group of falsely detected points. In tiie preferred embodiment, each of the plurality of 
15 image points is included in only one group selected from the group of object points, the 

gioiq) of junction points and the group of falsely detected points. Thus in step 303, each of 
the detected image points is evaluated and put into one and only one group. In otiier words, 
each image point us characterized as either an object point or a junction point or a &lsely 
detected point. 

20 Furthermore, the object points are grouped into sets of object points belonging 

to an individual object. SpedficaUy, the grouping of image points into olg^ 
grouping into sets corresponding to each object may be done using the process described in 
DJP. McReynolds and D.G. Lowe, '"Rigidity checking of 3D point corr^pondences under 
perspective projection", IEEE Trans, on PAMI, Vol. 18, No. 12, pp. 1 174-1 185, 1996. 

25 Li this case, the grouping is based on the fact that all points belonging to one 

moving rigid object will follow the same 3D motion model. Thus, junction points and fidsely 
detected points which do not follow any motion model are not considered obj^ points. 

The remaining points are subsequentiy processed to extract junctions. As a 
specific example, the image may be divided into a number of segments corresponding to 

30 disjoint regions of the image. The aim of image segmentation is to groi^) pixels together into 
image segments which are unlikely to contain depth discontinuities. A basic assunoption is 
that a depth discontinuity causes a shaip change of brightness or colour in the image. Pixels 
with similar brig^ln^s and/or colour are therefore groined together resulting in 
brightness/colour edges between regions. 
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In one embodiment, the segmentation cominises grouping picture elements 
having similar brightness levels in the same image segment. Contiguous groups of pictme 
elements having similar brightness levels tend to belong to the same underl3dng object 
Similarly, contiguous groups of picture elements having similar colour levels also tend to 
5 belong to the same underlying object and the segmentation may alternatively or additionally 
comprise grouping picture elements having similar colours in the same segment 

The segmentation process is in the preferred embodiment part of the detection 

process. 

In the preferred embodiment, the T-junctions are identified by analysing all 
10 2x2 sub-matrices of liie segmentation matrix. Since the T-junctions are to be detected, the 
analysis focuses on 3-junctions which are junctions at which exactiy three different image 
segments meet 

In order to extract 3 -junctions from the segmentation matrix, liie structure of 
all possible 2x2 sub-matrices is examined. A sub-matrix contains a 3-junction if exactiy one 
IS of the four differences 

is equal to zero. This is for example the case for the following sub-matrices: 

[: Ui :]•[; ;]•[; 'i 

but not for example for the following sub-matrix. 




25 

This sub-matrix is not considered to be a 3-junction because region nuniber 1, 
which occurs twice, is not 4-connected. This violates the basic assunoption tiiat regions in the 
segmentation must be 4-connected on a square sanq>ling grid. 

In other words, a 2 by 2 sub-matrix is considered a 3-junction if the four 
30 elements correspond to exactly tiiree image segmrats and the two samples fix>m the same 
image segments are next to each other either vocally or horizontal^ (Jbiit not diagonally). 
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It should be noted that a 3-junction is not necessarily a T-junction, but may 
also indicate a fork or an arrow shape (which may for example occur in the image of a cube). 
A further geometric analysis is therefore needed to determine whether a detected 3-junction 
maybe considered a T-junction. However, as such a geometric analysis has already been 
5 perfimned to extract the object points, tiie remaining points meeting the above criteria may 
be considered T-junction points. All other points are discarded as &lsely detected points. 

Step 303 is followed by step 305 wherein the image pointe of the groiqp of 
object points and the group of junction points are individually processed. In tibe preferred 
embodiment the individual processing is aimed at determining at least one three dimensional 
10 chaiacteristic from 2D images based on object points and junction points respectively. 

In the preferred embodiment, separate processes are applied to the different 
groups of object points. Thus, the individual proc^sing coniprises applying a first process to 
the group of object points and applying a second process to the group of junction points. 

In the preferred embodiment, the first process is an object proems which is 
15 based on object motion within tiie at least one image. The first process may for example be a 
process for determining 3D characteristics based on the movement of object points within a 
sequence of images. The process may for example be a (fynamic occlusion jxrocess but is in 
the preferred embodiment a structure fixnn moticm process. Thus, a 3D model of objects in 
the imagq noay be derived based on the movement of the corresponding object points. 
20 In Ihe preferred embodiment, Ihe second process is an object process based on 

a static characteristic of tiie image, and is spedfically a process for determining a depth 
characteristic of an object in the image. Thus, in tiie preferred embodiment object points are 
used to determine depth information based on dynamic characteristics whereas junction 
points are used for determining deptii information based on static characteristics. 
25 The second process may be a process for determining depth information in 

accordance with the approach described in "Filtering, Segmentation and Depth" by M. 
Nitzberg, D. Mumford and T.Shiota, 1991 . Lecture Notes in Computer Science 662. 
Spring^-Verlag, Berlin. 

FIG. 4 illustmtes an exan^le of a T-junction in an image and illustrate how 
30 deptii information may be found fix>m a T-junction. In the illustrated example, the image 
comprises a first rectangle 401 and a second rectangle 403. The first rectangle 401 overl^s 
the second rectangle 403 and accordingly edgps form an intersection known as a T-junction 
405. Specifically, a first edge 407 of the second rectangle 403 is cut short by a second edge 
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409 of fhe jQrst lectangle. Accoidingly, the fiist edge 407 forms a stem 41 1 of the T-jimctiorn 
405 and the second edge 409 fomis a top 413 of the T-junction. 

Thus, in the example Hie T-junction 405 is the point in the image plane where 
the object edges 407, 409 form a 'T" with one edge 407 terminating on a second edge 409. 
5 Humans are capable of identifying that some objects are nearer than others just by the 
presence of T-junctions. In the example of FIG. 4, it is clear that the first rectangle 401 
occludes the second rectangle 403 and thus that the object corresponding to the first rectangle 
401 is in firont of the object corresponding to the second rectangle 403. 

Hence, by determining a top and a stem of the T junction, relative depth 
10 information between objects may be determined. Identification of the top and stem is used in 
deriving a possible depth order. To identify the top and the stenti, it is in the preferred 
embodiment assumed that both are straight lines which pass through the junction point, but 
with an arbitrary orientation angle. Accordingly, the junction is fitted to first and second 
curves, which in the preferred embodiment are straight lines, and the regions forming the 
1 5 stem and the top are determined in response thereto. 

As is clear firom FIG. 4, the image section which forms the top but not tiie 
stem is inherentiy in front of the image sections forming the stem. Depth information 
between the two image sections forming the stem cannot directiy be derived firom the T- 
junction. In the preferred embodiment, many T-junctions are determined and specifically a 
20 given object may have many ccmresponding T-junctions. Therefore, relative depth 

information may be determined by considering the relative depth information of all objects 
and specifically a depth m^ representing the relative depth of objects in images may be 
derived. 

The depth information based on the dynamic performance of the object points 
25 may be combined with the relative depth information based on the static characteristics of the 
T-junctions thereby enhancing and/cn: improving the generated depth infimnation. 

The invention can be implemented in any suitable fi>rm including hardware, 
sofiwace, firmware or any combination of these. However, preferably, the invention is 
implemented as computer software running on one or more data processors and/or digital 
30 signal processors. The elements and components of an embodiment of the invention may be 
physicalty, ftmctionally and logically inq>lemented in any suitable way. Indeed the 
fimctionality may be ircplemented in a single unit, in a plurality of units or as part of other 
functional units. As such, the invention may be implemented in a single unit or may be 
physically and functionally distributed between different units and processors. 
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Although the present invention has been described in connection with the 
preferred embodiment, it is not intended to be limited to the specific form set forth herein. 
Rather, the scope of the present invention is Hmited only by the accontpanying claims. In the 
claims, the term comprising does not exclude the presence of other elements or steps. 
5 FurHiennore, altiiough individually listed, a pluraUly of means, elements or method steps 
may be implemented by e.g. a single unit or processor. Additionally, although individual 
features may be included in difiEerent claims, these may possibly be advantageously 
combined, and the inclusion in different claims does not imply that a combination of features 
is no feasible and/or advantageous. In addition, singular references do not exclude a plurality. 
10 Thus references to "a", "an", "first", "second" etc do not preclude a plurality. 
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CLAIMS: 



1. A method of object processing for at least one image comprising the steps o£ 

detecting (301) a plurality of image points (105, 107, 109, 111) associated 

with at least one object of the at least one image; 

grouping (303) flie plutalily of image points (105, 107, 109, 1 1 1) into at least a 

group of object points (105, 107) and a group of junction points (1 1 1); and 

individually (305) processing the image pomts of the group of object points 

(105, 107) and the group of junction pomts (1 1 1). 

2- ^ method of object processing as claimed in claim 1 wherein the step (305) of 

individuaUy processing comprises determining at least one three dimensional characteristic 
fiom at least one two dimensional image. 

2- A method of object processing as claimed m claim 1 wheiem the phirality of 

image points (105, 107, 109, 1 1 1) are further grouped into a group of filsely detected points 
(109). 

4. A method of object processing as claimed in claim 3 wherem each of flie 
pluraHly of image points (105, 107, 109, 1 1 1) is inchided in only one group selected fiom the 
group of object points (105, 107), the group of junction points (1 1 1) and the group of felsely 
detected points (109). 

5. A mefliod of object processing as claimed in claim 1 wherein the step (305) of 
individuaUy processing comprises applying a first process to die group of object points (105, 
107) and flying a second process to the group of junction points (1 1 1). 

6. A method of object processing as claimed in claim 5 wherem the first process 
is an object process based on object motion within the at least one image. 
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7. A method of object pKK»ssmg as claimed in claim 5 wheie^ 
is a stracture from motion process. 

8. A meliiod of object processing as claimed in claim 5 wherein the sectmd 
process is an object process based on a static characteristic within the at least one image. 

9. A method of object processing as claimed in claim 5 wherein the second 
process is a process for determining a depth characteristic of at least one object of the at least 
one image. 

10. A method of object processmg as claimed in claim 9 wherein flie depth 
characteristic is a relative depth characteristic indicating a relative depfli between a plurality 
of objects of the at least one ima^. 

11. A method of object processing as claimed in claim 1 wherem the step of 
detecting (301) the pluraHty of image points (105, 107, 109, 1 1 1) comprises applying a 
curvature detection process to at least a part of the at least one image. 

12. A method of object processing as claimed in claim 1 wherein tiie junction 
poinb (1 1 1) comprise T-junction points (111) conresponding to an overlap between two 
objects of tiie at least one image. 

13. A conq)uter program enabling tiie carrying out of a method according to claim 
1. 

14 A record canier comprising a conqputer program as claimed in claim 13. 

1 5_ An ^aratus fi>r object proce^ing for at least one image comprismg: 

means (201) for detecting a plurality of image points (105, 107, 109, 111) 

associated with at least one object of the at least one image; 

means (203) forgrovrpingtheplurality of image points (105, 107, 109, 111) 

into at least a ffoup of object points (105, 107) and a gcoup of junction points (111); and 
means (209) for individually processing tiie image points of the groiq) of 

object points (105, 107) and the group of junction points (1 1 1). 
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ABSTRACT: 



An apparatus (200) comprises a simple detector (201) jfor detecting a plvirality 
of image points (105, 107, 109, 1 1 1) associated with at least one object of the at least one 
image. The detector does not differentiate between different types of image points. The 
detector (201) is coupled to a giotiiMng processor (203) which groups iiie plurality of image 
5 points (105, 107, 109, 1 1 1) into a groi^) of object points (105, 107), a group of junction 
points (1 1 1) and a group of felsely detected points (109). The apparatus finther comprises a 
processor arrangement 209 fixr individual^ processing tiie image points of the group of 
object points (105, 107) and tiie gxnap of junction points (1 1 1). The object point process may 
generate depth information based on dynamic characteristics and the jimction point process 
10 may generate depth information based on static cSiaractetistics. Improved deirth information 
may thus be achieved and a simplified detector may be employed. 

FIG. 2 
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